The IMGT Strategy For The Automatic Annotation of IG And TR cDNA Sequences: IMGT/Automat

نویسندگان

  • Véronique Giudicelli
  • Céline Protat
  • Marie-Paule Lefranc
چکیده

IMGT, the international ImMunoGeneTics information system (http://imgt.cines.fr) [1] created in 1989, by the Laboratoire d'ImmunoGénétique Moléculaire (LIGM), at the Université Montpellier II, CNRS, Montpellier, France, is a high quality integrated information system, specializing in immunoglobulins (IG), T cell receptors (TR), major histocompatibility complex (MHC) and related proteins of the immune system (RPI) of human and other vertebrates. IMGT/LIGM-DB, the first and the largest IMGT sequence database, includes 69,616 nucleotidic sequences of 105 species in May 2003. We developed IMGT/Automat, an integrated IMGT Java tool, to automatically perform the annotation of the rearranged cDNA sequences which represent the half of the IMGT/LIGM-DB content. The annotation procedure includes the IDENTIFICATION of the sequences, the CLASSIFICATION of the IG and TR genes and alleles, and the DESCRIPTION of all IG and TR specific and constitutive motifs within the nucleotidic sequences, according to the IDENTIFICATION (standardized keywords), CLASSIFICATION (gene nomenclature), DESCRIPTION (standardized labels) and NUMEROTATION (IMGT unique numbering) concepts of IMGT/ONTOLOGY [2]. IMGT/Automat performs these tasks with the help of two available IMGT on-line tools: IMGT/V-QUEST (http://imgt.cines.fr) for the gene and allele identification and delimitations, and IMGT/JunctionAnalysis (http://imgt.cines.fr) for a detailed analysis of the junction in rearranged sequences. Because IMGT focuses on the quality of expertly annotated IG and TR sequences, we were aware that a such annotation tool must be as reliable and accurate as a human annotator is. Accuracy and reliability of the annotation are mainly estimated by the programme it-self with the evaluation of: the IMGT/V-QUEST alignment scores, the deduced sequence functionality, and the coherence of the characterized and delimited IG and TR motifs. IMGT/Automat is currently used by the IMGT team. It has performed the annotation of 7418 cDNA IG and TR sequences in May 2003.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Immunogenetics Sequence Annotation: the Strategy of IMGT based on IMGT-ONTOLOGY

IMGT, the international ImMunoGeneTics information system((R))(http://imgt.cines.fr) created in 1989, by the Laboratoire d'ImmunoGénétique Moléculaire (LIGM), Université Montpellier II and CNRS, Montpellier, France, is a high quality integrated information system, secialized in immunoglobulins (IG), T cell receptors (TR), major histocompatibility complex of human and other vertebrates and relat...

متن کامل

IMGT/V-QUEST: the highly customized and integrated system for IG and TR standardized V-J and V-D-J sequence analysis

IMGT/V-QUEST is the highly customized and integrated system for the standardized analysis of the immunoglobulin (IG) and T cell receptor (TR) rearranged nucleotide sequences. IMGT/V-QUEST identifies the variable (V), diversity (D) and joining (J) genes and alleles by alignment with the germline IG and TR gene and allele sequences of the IMGT reference directory. New functionalities were added t...

متن کامل

IMGT/JunctionAnalysis: the first tool for the analysis of the immunoglobulin and T cell receptor complex V-J and V-D-J JUNCTIONs

MOTIVATION To create the enormous diversity of 10(12) immunoglobulins (IG) and T cell receptors (TR) per individual, very complex mechanisms occur at the DNA level: the combinatorial diversity results from the junction of the variable (V), diversity (D) and joining (J) genes; the N-diversity represents the addition at random of nucleotides not encoded in the genome; and somatic hypermutations o...

متن کامل

IMGT/GENE-DB: a comprehensive database for human and mouse immunoglobulin and T cell receptor genes

IMGT/GENE-DB is the comprehensive IMGT genome database for immunoglobulin (IG) and T cell receptor (TR) genes from human and mouse, and, in development, from other vertebrates. IMGT/GENE-DB is the international reference for the IG and TR gene nomenclature and works in close collaboration with the HUGO Nomenclature Committee, Mouse Genome Database and genome committees for other species. IMGT/G...

متن کامل

IMGT/LIGM-DB, the IMGT® comprehensive database of immunoglobulin and T cell receptor nucleotide sequences

IMGT/LIGM-DB is the IMGT comprehensive database of immunoglobulin (IG) and T cell receptor (TR) nucleotide sequences from human and other vertebrate species. It was created in 1989 by LIGM, Montpellier, France and is the oldest and the largest database of IMGT. IMGT/LIGM-DB includes all germline (non-rearranged) and rearranged IG and TR genomic DNA (gDNA) and complementary DNA (cDNA) sequences ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003